Can we do better than Co-Citations? - Bringing Citation Proximity Analysis from idea to practice in research article recommendation
نویسندگان
چکیده
In this paper, we build on the idea of Citation Proximity Analysis (CPA), originally introduced in [1], by developing a step by step scalable approach for building CPA-based recommender systems. As part of this approach, we introduce three new proximity functions, extending the basic assumption of co-citation analysis (stating that the more often two articles are co-cited in a document, the more likely they are related) to take the distance between the co-cited documents into account. Asking the question of whether CPA can outperform co-citation analysis in recommender systems, we have built a CPA based recommender system from a corpus of 368,385 full-texts articles and conducted a user survey to perform an initial evaluation. Two of our three proximity functions used within CPA outperform co-citations on our evaluation dataset.
منابع مشابه
Paper recommendation using citation proximity in bibliographic coupling
Research paper recommendation has been a hot research area for the last few decades. Thus far, numerous different paper recommendation approaches have been proposed. Some of these include methods based on metadata, content similarity, collaborative filtering, and citation analysis, among others. Citation analysis methods include bibliographic coupling and co-citation analysis. Much research has...
متن کاملCitation Proximity Analysis (CPA) – A new approach for identifying related work based on Co-Citation Analysis
This paper presents an approach for identifying similar documents that can be used to assist scientists in finding related work. The approach called Citation Proximity Analysis (CPA) is a further development of co-citation analysis, but in addition, considers the proximity of citations to each other within an article‟s full-text. The underlying idea is that the closer citations are to each othe...
متن کاملIdentifying Related Documents For Research Paper Recommender By CPA and COA
This work-in-progress paper introduces two new approaches called Citation Proximity Analysis (CPA) and Citation Order Analysis (COA). They can be applied to identify related documents for the purpose of research paper recommender systems. CPA is a variant of co-citation analysis that additionally considers the proximity of citations to each other within an article’s full-text. The underlying id...
متن کاملIdentifying Related Work and Plagiarism by Citation Analysis
This updated and revised paper gives an overview of my PhD research. It focuses on two newly developed approaches. Citation Proximity Analysis (CPA) allows the identification of related work by analyzing the co-occurrence of citations within documents. In contrast to co-citation analysis various factors, such as the proximity of citations to each other, are taken into account. The second approa...
متن کاملThe Effects of Co-citation Proximity on Co-citation Analysis
In this paper we investigate the effects of co-citation proximity on the quality of co-citation analysis through four experiments of co-citation instances found in full-text scientific publications. First, we compared the distributions of co-citation instances at four levels of proximity in journal articles with the traditionally used article-level co-citation counts. Second, we analyzed how co...
متن کامل